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Abstract 


This  paper  deals  with  the  problem  of  select¬ 
ing  the  population  associated  with  the  larg¬ 
est  unknown  mean  from  several  normal  popula¬ 
tions  having  a  common  unknown  coefficient  of 
variation.  Both  subset  selection  and 
indifference  zone  approaches  are  studied. 
Based  on  the  observed  sample  means  and 
sample  standard  deviations,  a  subset  selec¬ 
tion  rule  is  proposed.  Some  properties 
related  to  this  selection  rule  are  discussed. 
For  the  indifference  zone  approach,  a  two- 
stage  elimination  type  selection  rule  is 
considered.  If  the  experimenter  has  some 
prior  knowledge  about  an  upper  bound  on  the 
unknown  means,  a  modification  is  introduced 
to  reduce  the  size  of  the  selected  subset  at 
the  first  stage  and  also  to  reduce  the 
sample  size  at  the  second  stage.  An 
example  is  provided  which  indicates  that  the 
saving  of  total  sample  size  is  quite  signi¬ 
ficant  if  this  prior  knowledge  is  taken  into 
consideration  in  designing  the  selection 
rule.  It  is  shown  how  to  implement  the 
above  selection  rules  by  using  several 
existing  tables. 
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1.  Introduction 


The  problem  of  selecting  the  population  associated  with 
the  largest  unknown  mean  has  been  extensively  studied 
in  the  literature  for  normal  populations.  References 
may  be  found  in  Gupta  and  Panchapakesan  (1979)  .  These 
procedures  are  not  applicable  when  the  population 
standard  deviations  are  proportional  to  the  population 
means,  a  situation  that  is  quite  common  in  physical  and 
biological  applications.  In  the  literature,  among 
others,  Gleser  and  Healy  (1976)  have  discussed  several 
best  asymptotically  normal  estimators  of  a  normal  mean 
with  a  known  coefficient  of  variation.  In  the  area  of 
ranking  and  selection,  Tamhane  (1978)  used  estimators 
developed  by  Gleser  and  Healy  (1976)  and  proposed  selec¬ 
tion  rules  (through  both  subset  selection  and  indiffer¬ 
ence  zone  approach)  for  normal  populations  having  a 
common  known  coefficient  of  variation,  and  provided 
tables  for  implementing  the  rules  in  the  large  sample 
case.  Gupta  and  Singh  (1983)  proposed  a  subset  selec¬ 
tion  rule,  based  on  sample  variances,  for  selecting  the 
population  associated  with  the  largest  mean.  Their 
selection  rule  is  independent  of  the  value  of  the  common 
coefficient  of  variation,  and  therefore,  can  be  applied 
to  the  situation  when  the  value  of  common  coefficient  of 
variation  is  unknown. 

In  this  paper,  the  problem  of  selecting  the  population 
with  the  largest  mean  from  several  normal  populations 
having  a  common  unknown  coefficient  of  variation  is 
studied.  The  statistical  formulation  of  the  problem  is 
given  in  Section  2.  In  Section  3,  a  subset  selection 
rule,  based  on  the  observed  values  of  the  sample  means 
and  sample  standard  deviations,  is  proposed.  Some 
properties  related  to  this  selection  rule  are  also 
discussed.  In  Section  4,  a  two-stage  elimination  type 
rule  is  given  based  on  the  indifference  zone 
approach.  If  the  experimenter  has  some  prior  knowledge 
about  an  upper  bound  on  the  unknown  means,  a  modifica¬ 
tion  is  introduced  which  reduces  the  sample  size  at  the 
second  stage.  To  implement  the  above  selection  rules, 
the  related  tables  are  available  from  Gupta  (1963),  and 
Gupta,  Panchapakesan  and  Sohn  (1985). 
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2.  Formulation  of  the  Selection  Problems 

Let  n^.,.,11^  be  k  (^2)  normal  populations  with 
positive  means  0^,...,e^  and  a  common  unknown  coeffi¬ 
cient  of  variation  b .  Let  6  be  the 

ordered  values  of  the  means.  We  shall  assume  that  the 
experimenter  has  no  prior  knowledge  concerning  the 
correct  pairing  between  v ^  and  6^^  (1  <  i,  j  <  k) . 

The  population  corresponding  to  0  ^  j  will  be  referred 

as  the  best  population.  In  the  .following,  we  will  let 
o  =  (o.,...,0.)  be  the  parameter  vector  and  n  be  the 

^  X  K 

parameter  space. 


Subset  Selection  Approach 

According  to  this  approach,  the  goal  of  the  experi¬ 
menter  is  to  select  a  (small)  subset  of  populations 
which  contains  the  best  population.  The  selection  of 
any  subset  containing  the  best  population  is  called  a 
correct  selection  (CS) .  The  decision-maker  restricts 
attention  only  to  those  rules  which  guarantee  the 
probability  requirement  (the  so-called  P*-condition) 
that 


P  fCS}  >  P*  for  all  3  €  3, 

O  —  •v 


where  P*,  k~  <  P*  <  1,  is  a  preassigned  constant. 


(2.1) 


Indifference  Zone  Approach 

According  to  this  approach,  the  goal  of  the  experiment¬ 
er  is  to  select  the  best  population.  The  selection  of 
the  best  population  is  called  the  correct  selection 
(CS) .  For  this  approach,  in  order  to  specify  the 
probability  requirement,  it  is  first  necessary  to 
define  a  measure  of  distance  between  two  populations. 
We  consider  the  following  measure  of  distance: 

j»°j)  =  ci~°j'  t^°  difference  between  the  two  popu¬ 
lation  moans  of  and  it ^ .  Note  that  this  measure  of 
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distance  is  different  from  that  considered  by  Tamhane 

(1978).  For  a  preassigned  value  6*  >  0,  let 

fi(<5*)  -  1  6  IJc-l]  +  6*).  B(6*)  is  known  as 

the  preference  zone  and  its  complement  as  indifference 
zone.  The  assignment  of  £*  value  will  be  based  on  the 
experimenter's  prior  knowledge.  The  experimenter 
restricts  consideration  only  to  those  rules  which 
guarantee  the  probability  requirement  that 


P  (CS)  >,  P*  for  all  £  €  n(6*) 


(2.2) 


where  P*,  k  <  P*  <  1,  is  a  preassigned  constant. 


3.  Subset  Selection  Approach 


The  goal  here  is  to  select,  on  the  basis  of  an  indepen¬ 
dent  random  sample  X^,  j  ■  l,...,n,  from  each  tt i 

(1  <  i  <  k),  a  subset  containing  the  best  population 
with  the  minimum  probability  P*  of  a  correct 
selection. 


Subset  Selection  Rule  R. 


For  each 


i,  let  XA  =  i  j|iXij  and  s\  =  ^  ?  <Xi:j-Xt>2 


Based  on  (X^,S^) ,  1  <  i  <  k,  we  propose  a  subset  selec¬ 


tion  rule  R^  as  follows: 


R^:  Select 


it.  if  X.  >  X.  -  cS. 
l  i  —  1  /  n  l 


for  j#i.  (3.1) 


The  constant  c  >  0  is  determined  by 


P{Zj  cW  for  1  <_  j  k-1)  =  P* 


(3.2) 


where 


.•a*.*. 

*  .  •  mf  m  .  < 


'.V.N'V 


Zj  *V*  N (0, 1)  ,  1  <  j  <  k-ly 

cov(Zi#Zj)  =  1/2,  for  i  +  j; 

2  2  (3.3) 

(n-1) 'u  x  (n-1);  and 

(Z. , . . .  ,Z.  , )  and  W  are  independent. 


The  value  of  c  satisfying  (3.2)  is  available  from  Gupta 
and  Sobel  (1957),  and  Gupta,  Panchapakesan  and  Sohn 
(1985). 

Probability  of  a  Correct  Selection  Associated  with  R^ 

Let  Pe(CS|R}  be  the  probability  of  correct  selection 

when  9  is  the  true  parameter  vector  and  the  selection 
rule  R  is  applied.  Then,  we  have 


Theorem  3.1.  P„ { CS ! R, }  >  P*  for  all  0  €  n. 

-  9  1  1  -  -v 

Proof:  Without  loss  of  generality,  we  can  assume  that 
0k  =  Following  a  straightforward  computation. 


P, { CS | R. } 

% 


“  pfi(Xv  1  x.  -  /2/n  cS.  for  j  $  k) 


&  k 


(3.4) 


=  P0{Y-  —  “-i  +  /2c8.(9)Wf  for  1  <_  j  <_  k-1} 

ft.  J  J  J  ^ 


where 


Yj  =  ^(Xj-Xj^-e  .+ek)b"1(e^+e^)"=  ^  n(0,1),  l±jik-l 

cov(Yi,Yj)  =  ojnej+o2)(e2+02,i-i  =  Pij(9),  i  /  j 
w  =  Sk/(bOk) , (n-l)W2  %  x2(n-l)  and  (3.5) 


(Y^ , . . . , Yk_^)  and  W  are  independent; 


and 


Note  that  aj(g)  1  0,  8..  (9)  >.  1//2  and  pi;j(£)  1  \  since 

®jc  —  ej  f°r  j  1*  k.  Then  by  Slepian's  inequality,  we 

conclude  that  the  smallest  value  of  the  constant  c  is 
given  by 

P^{CS  |  R^ }  1  p{Zj  <  cW  for  1  £  j  £  k-1 }  *  P*  for 

$  *  a, 

where  (1  <  j  <  k-1)  are  defined  in  (3.3). 

Least  Favorable  Configuration 

Let  »  {0|e^  *•••■  >  0).  For  each  e  €  n, 

0k  *  e being  fixed,  both  a j (£)  and  e j  (e)  are  decreas¬ 
ing  in  0.  and  a.  (9)  =  0,  8 -(e)  =  1//?  when  0.  =  0,  . 

J  J  °  J  ^  j  K 

Then  by  Slepian's  inequality,  we  can  see  that 

inf  P.{CS|R.}  =  inf  P_{CSlR.}  =  Pn  {CS|R.} 
e€fi  ®  1  9€fln  K  1  §0  1 

for  any  9g  6  Note  that  PQ  (CSjR.)  does  not  depend 

•^0 

on  0-. 

Some  Properties  of  R^ 

Property  1.  For  fixed  0  t  Si,  P0{CS|R^}  is  decreasing 
in  b  for  b  >  0. 

This  is  obvious  from  (3.4)  since  a ^  (o)  _>  0  and  the 
distributions  of  (Y^, . . . and  W  are  independent 


7 


of  the  parameter  b. 

For  fixed  b,  suppose  that  (e^,...,8k)  ■ 

(acj^  +  d^,...,ack  +  dk)  ,  where  ci  >  0,  a,  di  ^  0  and 

ac.  +  d.  >  0  for  each  i  =  l,...,k.  Also,  we  assume 
ii 

that  _>  Cj,  dk  _>  dj  for  j  /  k.  Thus,  8k  _>  8j  for 
j  /  k.  Then, 


Vcs|Ri} 

% 


=  p  „{Y.  <  /n  g.(a)b-1  +  /2ch. (a)W  for  j  #  k}. 


(3.7) 


0  1- 


where  g4  (a)  =  [a  (c,,-c^ )  +  (du-d.. )  ]  [  (ac^+d^)  +(ac^+d^)  ]  2; 


'k  ,"j#  "*k  j 


3  3 


2  2  —4 

h ^  (a)  =*  (ack+dk)  [  (ack+dk)  -MaCj+dj)  ) 


Lemma  3.1.  The  following  three  statements  are 
equivalent. 


ii)  gj (a)  is  increasing  in  a  for  a  >  0. 
iii)  hj(a)  is  increasing  in  a  for  a  >  0. 

From  (3.7)  and  Lemma  3.1,  we  have  the  following  results. 


Property  2.  Let  b  be  a  fixed  constant,  then 

d.  d.  , 

a)  if  —  j>  — 2.  for  all  j  #  k,  then  P  {CS|R^}  is 
ck  cj  " 

decreasing  in  a;  and 


b)  if  1  for  all  j  #  k,  then  P.iCSlR. >  is 

ck  ”*  cj  S  1 

increasing  in  a. 

The  following  two  situations  are  special  cases  of 
Property  2. 

c)  When  c^  *=  c2  «*...=  c^  and  d^,...,d^  are  fixed 
constants,  then  P  {CS|R,}  is  decreasing  in  a. 

a-  A 

d)  When  d^  =  ...«  d^  =  0  and  c^,...,c^  are  fixed 
constants,  then  P0{CS]R^}  is  a  constant,  which  is 
independent  of  the^parameter  a. 

Property  3.  Monotonicity  of  R^ 

Let  PQ{ni|R1}  be  the  probability  of  including  it.  in  the 

selected  subset  when  rule  R^  is  applied  and  e  is  the 
true  parameter  vector.  Then, 

Theorem  3.2.  If  0.  >  6.,  then  P,  {ir.|R,  }  >  P„{w.|R,  }. 
-  x  —  i  o  i  1  1  —  e  n 1  1 

Proof:  Without  loss  of  generality,  let  e2  e^,  and 

we  will  prove  that  Pq{h2|r1}  _>  I  •  Straight¬ 

forward  computations’'' show  that  ^ 

V'i"V 

l  Yij  1  »/n(oi-0:j)b"1(e^+e j)”^  +  /2coi(0^+9j)"'4JW  and! 


Y.  <  ^le.-oJb’^eJ+eVH/Tca.  (o?+9^)"^w  for 
im—  im  im  rim  i 

3  <  m  <  k 


for  i,j  =  1,2,  i  #  j,  where  cov(Yim,Yit)  - 

e?t  te?+e^)  (0^+e^)  ]“s,  1  m  <_  k,  i  +  m,  t  *  i,  m  /  i. 

We  then  conclude  the  proof  of  this  theorem  by  an 
application  of  Slepian's  inequality  and  by  noting  the 
following  facts: 

2  2  —  4 

i)  (o-e  ) (0  +8  )  a  is  increasing  in  0  for  0  >  0  for 
mm 

each  m; 

2  2  -4 

ii)  6(0+0^)  2  is  increasing  in  0  for  0  >  0  for  each 
m;  and 

iii)  cov(Y2m'  -  cov *Ylm' Yls.)  for  3  -  m»  1  -  k' 

m  ^  ii  and  cov(Y21,  Y2m)  _>  cov(Y12,  Y1r|)  for 

3  <  m  <  k. 


Expected  Size  of  Selected  Subset 

Note  the  selection  rule  R^  selects  a  non-empty  subset, 

the  size  of  which  is  not  fixed  in  advance  but  depends 
on  the  outcome  of  the  experiment..  Hence,  as  a  measure 
of  performance  of  the  selection  rule  R^,  we  can  con¬ 
sider  the  expected  size  of  the  selected  subset,  say 
(S | R^) .  We  have  the  following  expression; 

VslRi'  -  .IvilV 

^  1=1  -V 

(3.8) 

k  ^  i  _  *i  —  i 

=  y  Pn(V.  >  V.o.e”  +/nb  (e.0.  -l)-/2cW  for 
it1  e  i  -  3  j  i  3  i 

j  i*  i>, 

where  \r,  1  <  i  <  k,  are  iid  having  standard  normal 

distribution  and  W  is  as  defined  in  (3.3).  It  is  often 
of  interest  to  identify  the  parameter  configuration 


where  the  supremum  of  (3.8)  occurs.  For  k  =  2,  let 
02  =  o  and  =  ao,  0  <  A  <_  1.  Then, 

eo(S[Ri)  =  /♦ ( /nb_1 (A-l) (1+A2) -^  +  /2Ac (1+A2)  ~^w)dFw(w) 

K  0 

+  /$  (/nb~^  (1-A)  (l+A2)'^+/2c(l+A2)“®w)dFw(w)  , 
0  w 

where  $ ( • )  is  the  standard  normal  distribution  and 

Fr T ( - )  is  the  distribution  of  random  variable  W. 

w 

We  see  that  sup  E  (S  j  R, )  =  2P*  and  the  supremum  occurs 

when  A  =  1.  For  k  >  2,  it  appears  difficult  to  obtain 
such  a  result.  We  can  only  state  that 
sup  E  (S|R, )  _>  sup  E  (S  |  R.  )  =  kP*. 

0  6ft  $  1  9  £ f:  n  £  A 


Remark 

For  the  case  when  b  is  known,  Tamhane  (1978) ,  proposed 
a  subset  selection  rule  for  the  best  population  using 
estimators  of  Gleser  and  Healy  (1976) .  He  also 
provided  tables  for  implementing  the  rule  in  the  large 
sample  case.  However,  for  small  sample  sizes,  tables 
for  implementing  Tamhane ' s  rule  are  not  available.  It 
has  been  pointed  out  by  Tamhane  (1978)  that  for  certain 
values  of  b,  k,  n  and  P#,  Tamhane's  rule  does  not  exist. 
Based  on  sample  variances,  Gupta  and  Singh  (1983)  also 
proposed  a  subset  selection  rule  for  the  problem  of 
selecting  the  best  population.  Their  selection  rule 
and  the  associated  probability  of  correct  selection  are 
independent  of  the  value  of  the  common  coefficient  of 
variation  b,  and  hence,  can  be  applied  to  the  situation 
when  this  value  is  unknown.  They  made  some  comparison 
between  Tamhane's  and  their  rules.  It  is  found  that  in 
terms  of  expected  size  of  selected  subset,  the  perform¬ 
ance  of  Gupta  and  Singh's  rule  is  a  little  inferior  to 
that  of  Tamhane's  rule.  For  k  =  2  and  large  sample 
size,  there  is  not  much  difference  between  these  two 


11 


Note  that  the  proposed  rule  R^  is  independent  of  the 

value  of  the  common  coefficient  of  variation  b.  How¬ 
ever,  from  (3.4),  we  can  see  that  the  associated 
probability  of  a  correct  selection  depends  on  b  for 
each  fixed  0  €  n.  Therefore,  it  is  interesting  to 
compare  the^performance  of  rule  R^  with  Gupta  and 

Singh's  rule.  Further  study  of  this  comparison  is  to 
be  carried  out. 


4.  Indifference  Zone  Approach 

The  goal  here  is  to  derive  selection  rules  which  will 
select  the  best  population  with  a  guaranteed  probabil¬ 
ity  P*.  On  S7(6*),  the  associated  measure  of  distance 
between  populations  and  is  6(0j_,0j)  =  6i“0j* 

which  is  different  from  the  one  considered  by  Tamhane 
(1978).  Since  both  9  and  the  common  coefficient  of 
variation  b  are  unknown,  on  f2(S*),  it  is  impossible  to 
construct  a  single-stage  selection  rule  which  guaran¬ 
tees  the  probability  requirement  of  (2.2).  In  the 
following,  a  two-stage  elimination  type  selection  rule 
is  proposed. 


Two-Stage  Elimination  Type  Selection  Rule  R2 

Stage  1:  Take  nQ  ( _>  2)  independent  observations 
Xij(j  =  l,...,nQ)  from  each  (i  =  l,...,k),  and 

-(1)  1  n° 

compute  the  sample  mean  X^j  and  sample 

n0 

variance  S2  =  — l  (X .  .-X  .(1) )  2 .  Then  determine  the 
1  n0-1  j*l  ^  1 

set 

A  =  {ijxf1*  _>  X^-l/IdS^y/n^  -6*)+  for  j^ib  (4.1) 

where  y+  =  max(y,0)  and  d  is  a  positive  constant  chosen 
to  satisfy  (2.2)  . 


If  A  contains  only  one  element,  then  stop  sampling  and 

assert  that  the  population  associated  with  max  is 

l^j<k  3 

the  best. 


If  A  has  more 
second  stage. 


than  one  element,  then  proceed  to  the 


Stage  2:  Let  S  «  max  S. .  Determine 
A  i€A  1 

N  =  max{nQ, [2 (dSA/«*) 2] *} ,  where  ly] *  denotes  the 
smallest  integer  _>  y.  Take  N-n^  additional  observations 
Xij  from  each  ir^  (i  €  A)  if  necessary.  Then  compute 

1  N 

the  overall  sample  means  X.  *»  r;  I  X.  .  (i  6  A)  and 

1  N  j»l  13 

assert  that  the  population  associated  with  max  X.  is 

i€A  1 

the  best. 


Note:  This  selection  rule  is  essentially  of  the  same 
type  as  that  of  Gupta  and  Kim  (1984)  ,  even  though  we 
have  a  different  type  of  screening  procedure  and 
different  way  to  determine  the  value  N.  The  difference 
is  due  to  the  fact  that  in  this  paper,  the  concerned 
population  standard  deviations  are  proportional  to 
respective  population  means  while  Gupta  and  Kim  (1984) 
considered  a  common  unknown  standard  deviation. 


Probability  of  Correct  Selection 

Without  loss  of  generality,  we  still  assume  that  is 

the  best  population.  Therefore,  0^  ej  +  for  j  i*  k. 

Let  B  be  any  subset  of  (l,2,...,k)  containing  the 
element  k;  and  let  E(B)  be  the  event  that  subset  B  is 
selected  at  the  first  stage.  Also,  let 
C  =  (B  c  { 1, . . . ,k) | k  €  B}.  Then, 


6*)  for  j  +  k}. (4.2) 


U  E (B)  -  {X'*1  >  X?A,-( - - 

B€C  k  ”  3 

Also, 

Pfl ( CS | R- }  *  T  P.{CS  n  E (B) } 

K  1  BSC  ® 

=  l  Pfl{CS|E(B))P_{E(B)}. 

BSC  i  K 

Here  p^{CS|E(b)}  denotes  the  conditional  probability  of 
CS  given  E(B). 

Since  Qk  >.  0^  +  6*  for  j  #  k,  for  each  B  €  C, 
P0{CS|E(B)} 

'Xv 

-  Vxk  i  Xj  for  j  €  B-{ k }  |  E (B)  }  (4.4) 

>  PgtYj  <  /N6*b"1(ej+eJ)"^  for  j  6  B-{k}|E(B)), 

where 

Vj  =  ✓NI (Xj-xk)-(0j-0k)] (b(e j+e^)")"1  %  n (o , l) , 

1  <  j  <  k-l; 

cov ( Y . ,  Y . )  =  p..(e)  which  is  defined  in  (3.5)  for 

i  J  lj  'V 

i  /  j- 

Conditional  on  E(B),  »/N5*  /2dSg  _>  /2dS^.  Then,  it 
follows  that 

P0(CS]E(B) } 

2.  Pe(Y.  <  »/26.(jg)dW  for  j  €  B-  ( k  }  j  E  (B)  }  (4.5) 

1  P  Q ( Y  .  <  /2S.(9)dW  for  1  <  j  <  k-l ( E (B) } 


where  W  =  S./(bo.)  and  B  •  ( 0 )  is  defined  in  (3.6) 


Let  y*1*  -  /n7[(xi1)-x,i1))-(e^-e1,)]  ttxej+e*)4)'1, 

1  <  j  <  k-1 


3  ‘0l4“j  ~k  #  luj  k' 1  j  ■  vk' 


Then,  Yj1UN(0,l),  cov (Y*1} , Y j1* )  =  oij(^),  i  *  j. 
Also,  cov(Y^,y!^)  >  0,  1  ^  i,  j  ^  k-1.  It  should  be 
pointed  out  that  CV1# ...  #Yic— 1^  and  » •  •  •  »Yk-i^  are 


identically  distributed,  and  (Y^, . . . , Y^^)  and 


(Y 


(1> 


..,Yj|^)  are  independent  of  w. 


For  fixed  ek(»  e  ) ,  let  «  (e*,...,e£),  where 


e£  *=  and  e*  -  e^-4*  for  j  /  k.  Then, 


i  j  * 


6S(0)  >  s^(0  ). 

1  %  ^  J  *\f 


(4.6 


k  k 

Note  that  both  p..(0  )  and  8- (9  )  ate  decreasing  in  e. 

lj  'v.  1  ^  k 


pij($k)  -  I*  V$k)  -  as  6 


3 

k 


From  (4.2)  to  (4.6)  together  with  the  facts  that 
>  Bj  +  4*  for  j  #  k,  cov(Yi,Yj1^)  >.  0, 

/n^  4*  +  (/2dSk  -  /n^4*)  +  _>  /2dSk  and  repeated  applica 

tions  of  Slepian's  inequality,  straightforward 
computations  lead  to  the  following  result: 


>  /IP  { v .  <  /2B.(e)dw  for  1  <  j  <  k-l>]2dFw(w) 

~  o  %  3  -  3  %  ~  ~  w 

QO 

>  /IP  t{Y.  <  /2B.(9k)dw  for  1  <  j  <  k-1}] 2dFu(w) 

0  eK  3  3  --  w 

% 

oo  _ 

>  f/p  kfY.  /23H(ek)dw  for  1  <  j  <  k-l}dF  (w)]2 

0  0  J  J 

<B  _ 

£  ( / P { Z  .  £  dw  for  1  <  j  £  k-l}dF  (w)]2’ 

0  J  ” 

=  [ P{ Zj  <  dw  for  1  <  j  <  k-1}]2 

where  (Z^, . . . , 2k_^)  are  defined  in  (3.3),  and  distribu¬ 
ted  independently  of  W. 

For  given  P*,  we  can  choose  the  value  d  so  that 

P{Zj  £  dW  for  1  <  j  <  k-1}  ■  /P*.  (4.8) 

Therefore,  the  probability  requirement  of  (2.2)  will  be 
satisfied.  For  some  specified  values  of  P*,  k  and  n^, 

the  corresponding  d  value  can  be  found  in  Gupta  and 
Sobel  (1957)  and  Gupta,  Panchapakesan  and  Sohn  (1985). 


A  Modified  Two-Stage  Selection  Rule  R2(9*) 

In  practical  situations,  the  experimenter  sometimes  may 
have  some  prior  knowledge  about  an  upper  bound  on  0  ^ , 

say  0*.  This  knowledge  can  be  used  to  reduce  the 
sample  size  taken  at  the  second  stage. 

Let  e*(5*)  =*  (0?,...,0J)  where  et..  »  6*  and 

'v  i  K  |KJ  2  2  2  -1 

0!j]  “  0#“6*  for  5  *  k*  Let  p*  “  0*  (b*  +  (e*-5*)  ) 

and  6*  =»  e*  (e*2  +  (9*-6*)2)"^.  Let  Y*(l  <  i  <  k-1)  be 
standard  normal  random  variables  having  cov(Y*,Y*)  =  p* 
for  i  #  j,  and  W  be  a  random  variable  distributed 
independently  of  (Y*, . . . , Y*_^)  with  (n^-ljw2  having  a 

x2  distribution  with  (n0-l)  degrees  of  freedom. 

Let  d (0*)  be  chosen  to  satisfy 

P{Y*  ^  d(0*)/20*W  for  1  <  j  <  k-1}  *  /P*  .  (4.9) 

Then  a  modified  two-stage  elimination  type  selection 
rule,  say  R2(9*)»  can  be  defined.  This  selection  rule 

R2(o*)  is  similar  to  rule  Rj.  The  only  difference  is 

that  now  the  value  d(0*)  is  used  instead  of  the  value 
d.  We  denote  the  corresponding  N  by  N(0*). 

Following  (4.7),  one  can  see  that  for  each  0  6  fi(6*,0*) 
where  fi(6*,0*)  «  €  n^*)l0[jc]  L 

P0{CS|R2(G*)  } 

_>  [P{Y*  <.  d(G*)/20*W  for  1  <  j  <  k-l)l 2  =  P* . 

Since  o*  >  4  and  3*  >  — t  it  follows  from  Slepian's 
2  /2 

inequality,  (4.8)  and  (4.9)  that  d(0*)  <  d,  and 
hence  N(0*)  <  N.  Also,  if  we  let  A(0*)  denote  the 


subset  selected  at  the  first  stage  by  applying  rule 
I*2  (0*)  ,  that  is. 


_ii\  "^d  ( 0  * )  S .  , 

>(0*)  =  {ilx;  '  >  X4A,-( - — i  -6*)  for 

1  "  3  ^ 


(4.10) 


j  ?*  i># 

then,  since  d(o*)  <  d,  from  (4.1)  and  (4.10),  we  can 
see  that  A(o*)  A  ;  hence,  |A(0*)|  £  |A|  where  ]b| 
denote  the  size  of  set  B. 


Generally  speaking,  the  modified  selection  rule  1*2  (9*) 

reduces  the  si2e  of  the  subset  selected  at  the  first 
stage  and  reduce  the  sample  size  needed  at  the  second 
stage . 


An  Example  to  Illustrate  the  Use  Of  1*2  and  1*2(0*) 

Suppose  that  a  consumer  has  to  decide  on  buying  one  lot 
of  bolts  from  among  five  lots  that  are  available.  The 
tensile  strength  (in  pound  per  square  inch;  psi)  of  the 
ith  (1  £  i  £  5)  lot  is  normally  distributed  with  mean 
0^  and  standard  deviation  b0^,  where  both  b  and  0^  are 

positive  and  unknown.  Suppose  that  <5*  -  200  psi  and 
P*  =  0.90  have  been  specified  by  the  consumer.  Further 
suppose  that  n^  *  16  bolts  are  randomly  sampled  from 

each  of  the  five  lots  and  that  the  sample  means  and  the 
sample  standard  deviations  are: 

(x<1} ,x<1} ,  x<X) ,£*1) ,x*1,)»  (350, 380, 470, 600, 650) , 
(s1,s2#s3,s4,s5)  =  (  360, 420, 500, 58 0,600) . 

Now  /p*  «  0.9486833  »  0.95,  so  from  Table  IV  (p  =  0.5) 
in  Gupta,  Panchapakesan  and  Sohn  (1985),  using  interpo¬ 
lation,  it  is  found  that  d  2.34582.  Then, 

A  =  {3,  4,  5}.  Therefore,  we  proceed  to  the  second 
stage  and  find  that  N  =  100.  Further  additional  84 
observations  are  taken  from  each  of  the  selected  lot 


and  the  sample  mean  is  computed.  Finally,  the 

consumer  selects  the  lot  of  bolts  associated  with  the 
largest  sample  mean  among  X^,  3  <  i  <  5,  as  the  best. 

Suppose  now  that  the  consumer,  from  past  experience, 
has  some  knowledge  that  0^]  ±  1000  »  0*.  Therefore, 

he  prefers  to  apply  the  modified  selection  rule  R2(6*) 

for  his  selection  problem.  Now, 

p*  =  0*2(e*2+(e*-5*)2)_1  =  0.609756; 

6*  =  0*  (0*2+(0*-6*)2)"*  «  0.7808688. 

Let  dx  =  d(0*)/28*.  From  (4.9)  and  Table  IV  (p  =  0.6) 

in  Gupta,  Panchapakesan  and  Sohn  (1985),  using  interpo¬ 
lation,  it  is  found  that  d^  «*  2.30289.  Therefore 

d  ( 0*)  «  2.0853556.  Note  that  p*  »  0.609756  >  0.6.  So 
the  value  d(e*)  obtained  in  this  way  will  be  a  little 
conservative  since  the  exact  value  of  d(0*)  will  be  a 
little  less  than  that  used  here.  We  then  find  that 
A ( 0 *)  =  {4,5}  and  N(0*)  **  79.  Therefore  the  consumer 
needs  to  take  additional  63  observations  from  each  of 
4th  and  5th  lots  to  accomplish  the  selection  process. 
Note  that  the  total  sample  size  by  applying  rule 
R2(e*)  is  206  while  the  total  sample  size  by  applying 

rule  R2  is  332.  The  saving  of  the  total  sample  size  is 

quite  significant. 
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